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When the spatial sample size is extremely large, which occurs in 
many environmental and ecological studies, operations on the large 
covariance matrix are a numerical challenge. Covariance tapering is a 
technique to alleviate the numerical challenges. Under the assumption 
that data are collected along a line in a bounded region, we investigate 
how the tapering affects the asymptotic efficiency of the maximum 
likelihood estimator (MLE) for the microergodic parameter in the 
Matern covariance function by establishing the fixed-domain asymp- 
totic distribution of the exact MLE and that of the tapered MLE. 
Our results imply that, under some conditions on the taper, the ta- 
pered MLE is asymptotically as efficient as the true MLE for the 
microergodic parameter in the Matern model. 

1. Introduction. With the advancement of technology, large amounts of 
data are routinely collected over space and/or time in many studies in en- 
vironmental monitoring, climatology, hydrology and other fields. The large 
amounts of correlated data present a great challenge to the statistical anal- 
ysis and may render some traditional statistical approaches impractical. For 
example, in the maximum likelihood or Bayesian inference, the inverse of 
an n X n covariance matrix is involved, where the sample size n may be in 
hundreds of thousands or even larger. Inverting the large covariance matrix 
repeatedly is a great computational burden if not impractical, and some 
approximation to the likelihood is necessary. 

Covariance tapering is one of the approaches to approximating the co- 
variance matrix and, therefore, the likelihood. Let the second order station- 
ary Gaussian process X(t),t G have mean and an isotropic covariance 
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function K{h;6,a'^), where cr^ is the variance of the process and 9 is the 
parameter that controls how fast the covariance function decays. Given n 
observations X„ = {X{ti), . . . ,X(t„))', the log-hkehhood is 

(1.1) Ue, a^) = log 2^- ^ log[det V„(^, a^)] - ^K[^n{0, a^T^X^, 

where V„,(0, o"^) denotes the covariance matrix of X„. The idea of tapering is 
to keep the covariances approximately unchanged at small distance lags and 
to reduce the covariances to zero at large distances. To implement the idea, 
let -f^tap be an isotropic correlation function of compact support; that is, 
-^tap(^) = if /i > 7 for some 7 > 0. Then, the tapered covariance function 
K is the product of K and Ktap, 

(1.2) k{h; e, a^) = K{h; 9, a^)Kt^p{h), 

and the tapered covariance matrix is a Hadamard product V„ = V„(0, a^) o 
T„, where T„ has the {i,j)th element as i^tapdltj — tj||). The tapered co- 
variance matrix has a high proportion of zero elements and is, therefore, 
a sparse matrix. Inverting a sparse matrix is much more efficient compu- 
tationally than inverting a regular matrix of the same dimension [see, e.g., 
Pissanetzky (1984), Gilbert, Moler and Schreiber (1992) and Davis (2006)]. 
One would use the tapered covariance function K for spatial interpolation 
and estimation as if it was the correct covariance function. For example, the 
tapered maximum likelihood estimator maximizes the corresponding log- 
likelihood 

(1.3) /n,tap(e,^T') = -^log27r - llog[det V„] - ^X;V-iX„. 

Intuitively, if the taper is sufficiently close to 1 in the neighborhood of 
the origin, the tapering would not change the behavior of the covariance 
function near the origin. It has long been known that the behavior of the co- 
variance function near the origin is most important to spatial interpolation. 
Stein (1988, 1990a, 1990b, 1999a, 1999b) has estabhshed rigorous fixed- 
domain asymptotic theory for spatial interpolation. Applying the general 
fixed-domain asymptotic theory, Furrer, Genton and Nychka (2006) showed 
that appropriate tapering does not affect the fixed-domain asymptotic mean 
square error of prediction for Matern model. 

Kaufman, Schervish and Nychka (2008) showed that the parameter in 
the Matern covariance function, which is consistently estimable under the 
fixed-domain asymptotic framework, can be estimated consistently by the 
tapered MLE with 9 fixed. However, it is unknown if the covariance tapering 
results in any loss of asymptotic efficiency. 

The main objective of this paper is to establish the asymptotic prop- 
erties, and particularly the asymptotic distribution of tapered MLE under 
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the fixed-domain asymptotic framework. We now make a few remarks about 
why we adopt the fixed-domain asymptotic framework. When the spatial do- 
main is fixed and bounded, more sample data can be obtained by sampling 
the domain increasingly densely. This results in the fixed-domain asymptotic 
framework. It is known that not all parameters in the covariance function are 
consistently estimable [e.g., Zhang (2004)] under the fixed-domain asymp- 
totic framework. Zhang and Zimmerman (2005) argued that MLEs of the 
microergodic parameters are generally consistent but those of the nonmi- 
croergodic parameters in general converge in distribution to a nondegenerate 
distribution. We refer readers to Stein (1999b), page 163, for the definition 
of microergodic parameters. In addition, Stein has established asymptotic 
results that show only the microergodic parameters affect the asymptotic 
mean square error under the fixed-domain asymptotic framework. 

However, there is another asymptotic framework, where more data are 
sampled by increasing the spatial domain. This is the increasing domain 
asymptotic framework. Under mild regularity conditions, MLEs for all pa- 
rameters are consistent and asymptotically normal [see Mardia and Mar- 
shall (1984)]. Therefore, asymptotic results are quite different under the 
two asymptotic frameworks. Given a finite sample, one has to know which 
asymptotic framework is more appropriate in order to apply any asymp- 
totic results. Zhang and Zimmerman (2005) provided some guideline on this 
through both theoretical and numerical studies. Their results show that, 
for the exponential covariance function, the fixed-domain asymptotic distri- 
bution approximates the finite sample distribution at least as well as the 
increasing domain asymptotic distribution does. More specifically, for mi- 
croergodic parameters, approximations corresponding to the two frameworks 
perform about equally well. For the nonmicroergodic parameters, the fixed- 
domain asymptotic approximation is preferable. In light of these results, we 
adopt the fixed-domain asymptotic framework in this work. 

Fixed-domain asymptotic results for estimation are difficult to derive in 
general and there are only few results in literature [see Stein (1990c), Ying 
(1991, 1993), Chen, Simpson and Ying (2000), Zhang (2004), Loh (2005) 
and Kaufman, Schervish and Nychka (2008)]. Existing asymptotic distribu- 
tions have been established only for specific models such as the exponential 
model for covariance functions [see Ying (1991, 1993) and Chen, Simpson 
and Ying (2000)] and a particular Matern model with the smoothness pa- 
rameter V = 1.5 [see Loh (2005)]. For the general Matern model, the fixed- 
domain asymptotic distribution is not available even when data are observed 
along a line. In order to evaluate the efficiency of the tapered MLE, we estab- 
lish the fixed-domain asymptotic distribution of MLE for the microergodic 
parameter in the general Matern model [Theorem 5(i)] under the assump- 
tion that data are collected along a line. This result is of interest in its own 
right, outside the context of tapering. 
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It is even more difficult to study asymptotic properties of tapered MLE. 
Indeed, we are not aware of any fixed-domain asymptotic distribution estab- 
lished for tapered MLE. For this reason, we will start with a simple model, 
the Ornstein-Uhlenbeck process along a line, which is a stationary Gaus- 
sian process with zero mean and an exponential covariance function, and 
has Markovian properties. Due to the Markovian properties, the inverse of 
the covariance matrix can be given in closed form and is a band matrix. 
Therefore, for this model, it is not necessary to approximate the likelihood 
function. However, this simple model serves as a starting point in the study 
of covariance tapering and provides insight into the more general settings, 
which we will study subsequently. 

Although spatial data are usually collected over a spatial region, there 
are situations when data are collected along lines. One example is the In- 
ternational H20 project, where measurements of meteorological data were 
collected by surface stations and aircraft along three flight paths that are 
along straight lines and transect the varied environmental conditions of the 
southern Great Plains [see Weckworth et al. (2004), LeMone et al. (2007) 
and Stassberg et al. (2008)]. Ecological data are sometimes collected along 
line transects as well. 

The main results for the Ornstein-Uhlenbeck process are presented in 
Section 2. For the microergodic parameter in the Ornstein-Uhlenbeck pro- 
cess, we establish the asymptotic distribution of tapered MLE. In Section 
3, we present the main results for a Gaussian stationary process having a 
Matern covariogram. We put all proofs in Appendices A and B. 

2. Exponential modeL We assume the underlying process X(t), t G [0, 1] 
is Gaussian that has a mean and an isotropic exponential covariogram 
K{h) = cr^ exp(— 0/i). Such a process is known as the Ornstein-Uhlenbeck 
process, which has a Markovian property that will be exploited in our proof. 

The exponential isotropic covariance function is one of the most com- 
monly used models for spatial data analysis. It follows from Ying (1991) 
and Zhang (2004) that both and 9 are not consistently estimable under 
the fixed-domain asymptotic framework, but the product a^6 is. Applying 
the fixed-domain asymptotic theory for spatial interpolation, Zhang (2004) 
showed that it is only this product, and not the individual parameters and 
6, that asymptotically affects the interpolation. Therefore, it is important 
to estimate this product well. In this section, we establish the asymptotic 
properties of the tapered MLE of this product. For simplicity of argument, 
we will maximize the likelihood function over (0,cr^) £ J = [a,b] x [w,v] 
for some constants < o < 6 and < w < v and do not require that J 
contains the true parameter value (^OjCTq)- However, we do assume that 
^qCq S {0(7^, (0, fj^) G J}; that is, there exists a pair {0,a'^) in J such that 

The following two assumptions are made throughout this section: 
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(Al) The process is observed at points tk^n £ [0, 1], k = 1, . . . ,n, with < 
ti,n < *2,n < • • • < tn^n < and suppose that nAf^^n is bounded away 
from and oo, where A^^^ = — tk-i.n, k = 2, . . . ,n. We also assume 
that tn^n 1 and ti^n — > as n — > oo. 

(A2) Kta,p{h;'-f) is an isotropic correlation function such that i^tap(/i;7) = 
if /i > 7, where 7 € (0,1) is a constant. Moreover, ii'tap(^;7) has a 
bounded second derivative in h £ (0, 1) and Kl^p{h;'y) = ch + o{h) as 
h 0+ for some constant c. 

A taper can be any correlation function with compact support, and such 
correlation functions have been studied in literature [see Wu (1995), Wend- 
land (1995, 1998) and Gneiting (1999, 2002)]. We believe that a large num- 
ber of compactly supported correlation functions satisfy assumption (A2). 
Particularly, a Wendland taper is a truncated polynomial and, therefore, 
satisfies (A2) if the degree of the polynomial is greater than 3. 

We also note that the assumption in (A2) that i^tap has a bounded second 
derivative in /i G (0,1) can be weakened, so that d? Kta.^ / dh? exists at any 
h G (0,7) as long as the first derivative exists everywhere in (0,1). The 
weakened condition will necessarily make the proof longer and, therefore, is 
not considered in this paper. 

Before we state the main results of this section, we need to introduce 
some notation that will be used throughout this paper. For sequences of real 
positive numbers a„ and a sequence of real or random numbers 6„ that may 
depend on model parameters, hn = Ou{an) if, for any n, -P(|^n| < Man) = 1, 
for some < A/ < 00, which does not depend on parameters but could be 
random. That is, hnjcLn is bounded uniformly in the parameters. Similarly, 
we write hn = Ou{an) to mean that, with a probability 1, bn/cin converges 
to uniformly in parameters. The following theorem compares the tapered 
log-likelihood function with the untapered one, and their derivatives. This 
theorem is essential to the establishment of the asymptotic properties of the 
tapered MLE to be given in the subsequent theorem. 

Theorem 1. Under the assumptions (Al) and (A2), uniformly in {O^a"^) G 
J and with Po-probability 1, 

(2.1) InMG.cy^) = ln{0,a^) + Ou{n^'^), 

(2.2) ^/„„tap(e,fT') = -Q-^ln{e,CJ^)+Ou{n^'^), 

where Pq is the probability measure corresponding to the true parameter val- 
ues (7q,9o. 

The next theorem establishes the strong consistency and the asymptotic 
distribution of the tapered MLE. Comparing the asymptotic distribution of 
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MLE of (Tq^o ill Ying (1993) and that in the fohowing theorem, we see that 
the tapered MLE is asymptoticahy equahy efficient. 

Theorem 2. A ssume (Al) and (A2) hold, and let (^n, tap; '^n tap) i^mx- 
imize the tapered likelihood function over {6,a'^) G J. Then, as n— >oo, 

(2-3) PoQim_4,tapO-^,tap = ^0'7o) =1, 

(2.4) V^(^n,tap<T^,tap " Oo^l) ^ iV(0, 2(0oCTo')'), 

where Pq is the probability measure corresponding to the true parameter val- 
ues (Tq,9o. 

3. General Matern model. In this section, we will focus on studying the 
asymptotics of tapered MLE for a general Matern model. We assume the 
underlying process is stationary with mean and the following isotropic 
Matern covariogram: 

(3.1) K{h-a\e,v) = ^^^}C,{eh), h>0, 

with unknown a^,9 and known u, where /Cjy is the modified Bessel function 
of order v [see Abramowitz and Stegun (1967), pages 375 and 376], cj^ is the 
covariance parameter, 6 is the scale parameter and i' is the smoothness pa- 
rameter. Further, assume that the process is observed at n sites ti,t2, ■ ■ ■ ,tn 
in a bounded interval D C ffi, and write X„ = (X(ti), . . . , X(t„))'. Zhang 
(2004) noted that neither o"^ or 6 is consistently estimable under the fixed- 
domain asymptotic framework, but the quantity is consistently es- 
timable. Furthermore, this consistently estimable quantity is more impor- 
tant to prediction than the parameters o"^ and 6. 

The primary focus of this section is to establish the asymptotic distribu- 
tion of the estimators for a^O^'^. This is a more difficult problem than in the 
exponential case, and we cope with it by considering an easy version of the 
problem. Following Zhang (2004), we fix 6 at an arbitrarily chosen value 6i 
and consider the following estimators: 

(3.2) ^2 =ArgMax/„(0i,cT2), 

(3.3) = ArgMax/„,tap(ei,fT^), 

where ln{0i,(T'^) and /n,tap(^i) c^) are the log-likelihood function and the 
tapered log-likelihood function, respectively. 

We make the following assumption on the spectral density of the taper 
^tapih)- Similar conditions were used in Furrer, Genton and Nychka (2006) 
and Kaufman, Schervish and Nychka (2008). Our condition here is stronger, 
and it is necessary for our approach to deriving the asymptotic distribution 
of tapered MLE: 
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(A3) The spectral density of the taper, denoted by /tap (A), satisfies for some 
constant e > max{l/2, 1 — u} and < M < oo 

(3.4) /tap(A) < 



(l + A2)^+V2+e- 

We note that taper condition (3.4) is satisfied by some weh-known tapers. 
For example, Wendland tapers (1995, 1998) have isotropic spectral den- 
sities that are continuous and satisfy gd,k{^) < Af(l + ^2^-d/2-fc-i/2 
some constant M, where d is the dimension of the domain {d= 1 in this 
work). Therefore, condition (3.4) is satisfied if A; > max{l/2, z^}. Furrer, 

Genton and Nychka (2006) gave explicit tail limits for two Wendland ta- 
pers i^i(/i;7) = (l-^)t(l + 4^),7>0 and K2{h-n) = {1 - + 6^ + 

^^),7 > (x+ = max{0,x}), and showed that X'^giiX) 120/(7r7^) and 

A^fl'2(A) — > 17920/(7r7^), as A— >oo, where gi is the spectral density of Ki 
{i = 1,2). Therefore, condition (3.4) holds if < 1 for taper Ki and v <2 
for taper K2- 

One important probabilistic tool we will extensively use is the equiva- 
lence of probability measures. The assumption (A3) implies that the ta- 
pered covariance function specifies a Gaussian measure that is equivalent to 
the Gaussian measure specified by the true covariance function [Kaufman, 
Schervish and Nychka (2008)]. It readily follows that d''^t^pdi'^ is a strongly 
consistent estimator of ctqOq'^ [e.g., Kaufman, Schervish and Nychka (2008)]. 

The main results in this section are the following three theorems. The 
next theorem is a general result about two equivalent Gaussian measures 
and is not restricted to the case of covariance tapering. It will be used to 
prove the other two theorems. 

Theorem 3. Let X{t),t €M. be a stationary Gaussian process having 
mean zero and an isotropic covariogram Kj and a continuous spectral density 
fj under measure Pj,j = 0, 1. Assume the process is observed at ti,t2, ■ ■ ■ in 
a bounded interval D, and let X„ = (X(ti), . . . ,X[tn))' ■ If 

(3.5) liminf/o(A)|Ari >0 and limsup/o(A)|Ar'i < cx) 
and 

(3.6) /i(A) = ^{4tv- 1 = 0(1 Ar^2), A^oo /or some r-2 > 1, 

/o(A) 

then 

(3.7) ^o(xUVi-i - Vo;^)x„) = 0(1), 

where Vj^n is the covariance matrix of X„ given by the covariogram Kj, 
j = 0, 1, and Eq is the expectation with respect to Pq. 
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We note that condition (3.6) is stronger than the equivalence of the two 
Gaussian measures corresponding to the two spectral densities /o and /i. 
Indeed, under condition (3.5), the two Gaussian measures are equivalent 
if (3.6) holds for some r2 > 1/2. However, equivalence alone cannot imply 

(3.7) , and we need stronger conditions than the equivalence of two measures. 
We will show later that condition (A3) implies that (3.6) holds, for some 
r2 > 1, if /o and fi represent the spectral densities of the true and tapered 
covariograms, respectively. 

Theorem 4. Suppose condition (A3) is satisfied, and the underlying 
process is stationary Gaussian having a mean and a Matern covariance 
function, and the sampling locations {ti,t2, ■ ■ ■} are from a bounded interval. 
Then, for any fixed 6i > 0, vuith Po-probability 1, uniformly in cx^ G ['u^,''^], 

(3.8) /n,tap(ei, <t') = UOi,a^) + 0,(1), 

(3.9) _/„ ,^p(0,, ^2) ^ _;^(^^^ ^2) ^ 0^(1)^ 

where Pq is the probability measure corresponding to the true parameter val- 
ues ctqjOo, v. 

Next, we give the asymptotic distributions for both exact MLE and ta- 
pered MLE of the consistently estimable quantity 

Theorem 5. Assume that the underlying process is stationary Gaussian 
having a mean and a Matern covariance function with a known smooth- 
ness parameter v , and the sampling locations {ti,t2, ■ ■ ■} are from a bounded 
interval: 

(i) For any fixed 9i, 

(3.10) V^ialer - crXn ^ iV(0, 2(^2^)')- 

(ii) In addition, if the taper satisfies condition (A3), 

(3.11) Mai,.^er - ^ m ^i^xn'). 

Theorem 5 implies that the covariance tapering does not reduce the 
asymptotic efficiency for the Matern model. In this paper, we are not able to 
show that (3.11) remains true if 6i is replaced by the MLE of 6. Therefore, 
the results in this theorem are not as strong as those in Theorem 2. More 
work will need to be done to extend Theorem 2 to the general Matern case. 

We note that asymptotic distributional results about the microergodic 
parameter in the general Matern class have not appeared in litera- 

ture. Theorem 5(i) is the first of such results, and its proof requires a novel 
approach. 
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4. Discussion. There are some open problems for future research. First, 
for the Matern model, the estimator of aH^^ is constructed by fixing 6 at 
an arbitrary value. For a finite sample, common practice is to also estimate 
9. It is an interesting question to see if Theorem 5 still holds for the MLE 
a 9 and the tapered MLE 

^n,ta.p^n^tap' Conjecture is that Theorem 5 

can be extended to this case. 

The main results in Sections 2 and 3 are for the processes with one- 
dimensional index. It is a more interesting problem to study the high- 
dimensional case. However, our techniques in Section 3 cannot be extended 
to obtain analogous asymptotic distribution in the high-dimensional case. 
For example, for a d-dimensional process, we would need (3.6) to hold for 
some r2 > d in order for the proof to carry through. Unfortunately, for the 
Matern model, (3.6) cannot hold for any r2 > 2. The high-dimensional case 
calls for new techniques for establishing asymptotic distributions. A referee 
suggested letting the bandwidth 7 vary and go to as n increases to 00. This 
is a natural scheme in the fixed-domain asymptotic framework. We believe 
that everything in Section 2 carries through if the bandwidth goes to not 
too fast. When the bandwidth of the taper depends on n, it is not obvious if 
our techniques in Section 3 still apply, because the properties of equivalence 
of probability measures are no longer directly applicable. 

APPENDIX A: PROOFS FOR SECTION 2 

In the sequel, we often suppress n in the subscripts. For example, write 
tk = tk,n: = ^k,n- We will need three lemmas for the proofs of the theo- 
rems in Section 2. 



Lemma 1. Let X{t) be the Gaussian Ornstein-Uhlenbeck process, and 
assume (Al) holds. Denote E{X(ti)\X{tj),j / i) = -J2jjLibij,n{^)^ih)> 1 ^ 
i <n, Yai {X{ti)\X{tj),j ^i) = di^n{9,cr'^), which is written as di for short. 
Then, for 1 <i <n, 



Jii-i 



AO) 



(A.l) 



bii+l,n{9) 



I _ g-20(Ai+A,+ i) 
g-6lA,+i^ _ g-26lAi 



(A.2) 



I _ g-2e(Ai+A,+ i) ' 

buAO) = -e-'^\ bnn-iAO) = -e~'^\ 
bij,ni9) = for\i-j\>l. 
In addition, uniformly in {9, a^) £ J, 1 < i <n, I <j <n, 

di = 2a^9A2 + oJ^), dn = 2a^9An + 0^. „ 
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f 1 ^ 


Ou( 












Ou( 









Proof. Note that E{X{ti)\X{tj),j ^ i) = -Y.j+ihj,n{0)X{tj),\< i < 
n if and only if 

Cowl X{ti) + Y,hik,n{0)X{tk),X{tj) \ = {) for any j/i,i = l,...,n. 

We therefore prove (A.l) and (A. 2) by verifying that 

Cov(X(ii) + hi,i-iX{ti-i) + hi,i+iX{ti+i),X{tj)) = 

(A.5) 

for any j ^i, 

where we let 6io = hn,n+i = 0. For i = 1 or n, (A.5) readily follows the sta- 
tionarity and the Markovian property of the Ornstein-Uhlenbeck process. 
For \ <i <n, (A.5) holds, because, if j > i + 1, the left-hand side of (A.5) 
equals 

' 1 _ e-2e(Ai+A,+i) 

g-0Ai+iM _ g-26»Ai 



1 _ g-2e(Ai+Ai+i) 

which is zero. We can get the similar expression when j < i — 1. There- 
fore, (A.5) is proved. Since di = E{X{ti) + 6j,i_iX(ti_i) + hi^i+iX{ti+i)f , 
straightforward calculation yields 

= cj\l- e-2^^^), d„ = cj\l- e-2^^"), 
(l_e-2eA,wi_g-20A,+i) 
^' = ^ 1 _ e-2.(A,+A.,0 ' 1<^<"- 

Then, (A.3) follows the Taylor expansion. To establish the properties of 
the derivatives in (A.4), we repeatedly use the Taylor expansion. Here we 
only provide a proof for 6-j„(6') = Ou(l/n^) for 1 < i <n, since all other 
derivatives can be proved similarly. Since 6jj = if |i — j| > 1, we only need 
to consider j = i — 1 or i + 1. 
Write the derivative 



-29(Ai+A,+ l)^2 
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where 

A 



(A.6) 



Note that 



_ (_e-^^- + e-^^»-2^^'+i)2(Ai + A,+i)e-2^(^'+^»+i) 
Aie-''^^ - (A, + 2Ai+i)(e-^^»-2^^^+i - e-3eA.-2eA,+i) 



1 



1 



1 _ g-2e(A,+A,+i) 20(Ai + Ai+i) 

is uniformly bounded and n(Aj + Aj+i) is bounded away from and oo 
by assumption (Al). Hence, 1/(1 — e~2S(^»+'^»+i)) = 0(l/n), and it suffices 
to show that A is Oui^/n^)- Using, again, the fact that Aj = 0„(l/n) and 
applying the Taylor expansion, we get 



A,;e" 



A, + (1/2)02^3 ^0„(l/n 



-Aie" 



- (Ai + 2Ai+i)(e-''^»-2^^»+i - e-3eA,~2eA,+i) 
= -261 Af + 46|2Af + 126'2A^Ai+i 

- 4MiAi+i + 802A, Af+i + 0„(l/n4), 
-39A,-40A,+i ^ ^ 3^^2 ^ 40A.Ai+i - (9/2)^2^3 

- 12^2^2^,+, _ 802a,a2+i + 0„(l/n^). 

All the terms except Ou(l/n^) are canceled out. Therefore, A = Oui^/n^) 
and b[^_i ^{9) = ©^(l/n^). Similarly, we can show b[^_^_i = ©^(l/n^). □ 

We now introduce the following notations. Let 0„ denote a matrix of 
which the elements are Ou{^/n) except those in the first and last rows, which 
are uniformly bounded; that is, 0«(1). Denote, by On, the matrix whose 
(i,j)th element is Om(1) if i = 1 or n or z = j, and is Ou{^/n) otherwise. 
Therefore, 

/ Ou{l) ••• 0.(1) \ 
Ou{l/n) ••• Ou{l/n) 



Or 



Ouil/n) 
\ Ouil) 

(Ou{l) 



Ou{l/n) 
Ou{l) J 

Ou{l) 



On(l) 
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Lemma 2. Under assumptions (Al) and (A2), uniformly in 6 £ [a,b]: 

(i) V-i(V„oT„)=I„ + 0„, V-i^ = 0„, 

(ii) ^(v-(v„oT„)) = 6„, |(v-^)=6„,, 

(iii) l<det(V-i(V„oT„,)) = 0„(l), (V-i(V„,oT„))-^ = I„ + 6„, 
where I„ is the n x n identity matrix. 

From the definitions of 0„ and 0„,, we have 
(A.7) 6n6n = On, 6„6„ = 6„, 6„6„ = 6,. 

Then, Lemma 2(i) and (ii) imply 

(A.8) ^(v-i(V„oT0)-i=O„. 

Proof of Lemma 2. We can assume = 1 without loss of any gen- 
erality, because all quantities in the lemma do not depend on o"^. We will 
repeatedly use Lemma 1 and particularly the fact that V^^ is a band ma- 
trix. The proof involves tedious computation, and we will keep a balance 
between brevity and clarity. 

Several quantities in the lemma are of the form V~"'^(V„ o Q), where Q is 
an n X re matrix whose (i,j)th element is Q{ti — tj) for some even function 
Q{t) that has a bounded second derivative on [—1,0) U (0,1]. If the limits 
of the derivative g'{0+) = limt„»+ g'{t) and q'{0—) = limf_>o- Q'it) exist and 
are finite, we show now 

V;:^(V„ o Q) = ^(0)1, + {g'{0+) - q'{0-)) 

(A.9) 

xdiag{0„(l),...,0„(l)} + 0„, 

where diag(0„(l), . . . ,Ou{^)) denotes a diagonal re x re matrix with bounded 
elements. There are immediate corollaries from (A.9). First, it implies V~^(V„o 
Q) = On. Second, by taking Q{t) = — \t\, we get Y~^{dYn/d9) = On- Last, if 
g{t)=Kt^p{\t\), then g' (0+) = g' {0-) and V-i(V„ o T„) = L„ + 0„ because 
i^tap(O) = 1. 

To prove (A.9), let uJij denote the (i,j)th element of V~^(V„o Q). Let 
bij be defined in Lemma 1. Hereafter, the parameter and subscript re are 
suppressed. Then, it is well known that the (i, j)th element of is bij/di. 
Write bij = if j < 1 or j > re and tQ = ti,tn+i = tn- Since by = if \i — j\ > 1, 

(A.IO) ^^3=d;^ b^kK{\tk-tj\)g{tk-tJ). 

k=i-l 
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For any i > j, and k = i — 1 or i + 1, we have tk — tj > 0. Hence, the Taylor 
theorem imphes 

Q{tk - tj) = g{ti - tj) + g'iU - tj){tk - ti) + Q"{ti - tj + C{tk - tj)){tk - ti)V2, 

for some ^ G (0, 1). Since g has a bounded second derivative on (0, 1), and 
tk — ti = 0{l/n), we have 

(A.ll) Q{tk - tj) = Q{ti - tj) + g'iti - tj){tk - ti) + 0(l/n2). 
Then, 

i+l 

i^ij = g{ti - tj) hikK{tk-tj) 

k=i-l 

(A.12) 

•i+i 

+ dT^g'{ti-tj) J2 K{tk-tj){tk-ti)bik + Ouil/n), 

k=i-l 

where we have used dl[^ = Ou{n). Note that Z^Iii-i bikK{ti^ — tj) is the 
(i, j) element of D~^B„Vji = I„. Hence, the first summand in (A.12) equals 
g{Q)l{i=j}. 

Similar to the establishment of (A.ll), we can show 

K{tk - tj) = K{ti - tj) + K'{ti - tj){tk - ti) + Ou{l/n^). 
It follows that, for i > j, 

(A.13) u;ij = dr^g\ti-tj)K{ti-tj) ^ {tk - ti)b^k + Ou{l/n). 

k=i-l 

By utilizing the explicit expressions of bij given in Lemma 1, we can show 
(tk — ti)bik = 6i,i+iAj+i — bi^i-iAi 

k=i-l 

(A.14) 

Ouil/n?), ifl<i<n, 



Ou{^/n), if i = 1 or n. 
Then, for i> j 

Om(1), if i = 1 or n. 



^^■^^^ \Ou{l/n), ifl<i<n. 

In view of the fact that g is an even function, we can show, similarly, that 
(A.15) holds for i <j. 

Now, let us consider wa. First, note that 

(A.16) git,.i - ti) = g{0) + g'{0-){ti_i - t,) + 0{l/v?), 

(A.17) g{ti+i - ti) = g{Q) + g'{Q+){ti+i - ti) + 0{l/v?). 
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Since d-^ Eit^i-i hkKitk - t.) = 1, 

i+l 

^ii = ^ikK{tk - ti)g{tk - ti) 

k=i-l 

= q{Q) + d-^{h^i^iK{ti^i - U)Q'{0-){t,^i - ti) 

+ bi,i+iK{ti+i - ti)Q'{0+){ti+i - ti)} + 0„(l/n2). 
Since K{h) = K{0) + K'{0)h + o,„(/i) as /i ^ 0, 

cuii = g{0) + K{0)d-^{h,,.ig'{0-){ti.i - ti) + bi^i+ig' {0+){ti+i - ti)} 
+ Ou{l/n^), 
which can be rewritten as 

ujn = g{0) + g'{0-)K{0)d;^ ^ {tk-ti)bik 

k=i-l 

(A.18) 

+ [£''(0+) - g'{Q-)]K{Q)d-%i+i^,+i + 0„(lV). 

Then, (A. 9) follows from (A. 14), (A. 15) and (A.18). (i) is therefore proved. 

To prove (ii), we will use the following well-known fact [see, e.g., Graybill 
(1983), pages 357 and 358]: 

fAl9) Av-i = -V-i^V-i 

Applying Lemma 2(i), 

|(V-i(V„ o T.)) = -V-i^V-HV„ o T„) + o T, 

-_v-i^(i +6 )+v-i^^^" 



de ' \ de 

= v-^(^o(T„-j„))+o„, 

which is clearly from (A. 9) by taking g{t) = — |t|(iftap(|i|) — 1) that is 
differentiable at 0, where Jn is a matrix of all I's. 

Next, we will show ^(V~^^^) = 0„ similarly. Write 

(A 20) ^ fv-i^^ = _v-i^V-i^ + V-i^ 

^ ' de\ de ) de " 50 " • 

By (i), the first term on the right-hand side of (A.20) is 0.„0„ = 0^, and the 
second term is 6„, because V~-^ ^ge^" = ^n^^n ° Q) with g{t) = t"^, which 
has a continuous second derivative so that (A. 9) applies. This completes the 
proof of Lemma 2(ii). 
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Let A„ = V-i(V„oT„) and ttij denote the (i,j)th element of A„. We 
now apply a series of column operations, so that A„ becomes In + fin and 
each of the operations retains the determinant of A„, where ri„ is a matrix 
whose elements are bounded by M/n for some constant M not depending 
on 0'^ that is, l^n(^;j) — ^/^- ^Ve have shown that A^ = In + On, where 
elements of On are 0„(l/n) except those that are on the first and last rows 
that are bounded. We can subtract from the j'th column the first column 
multiplied by the (l,j)th element of An, 2 <j <n. Then, all elements in 
the first row are Ou{^/n), except the (1, l)th element, which is 1 + Ou(l/n) 
and remains unchanged throughout the operations. Similarly, we can reduce 
the elements in the last row to Ou(l/n) except the last (?i,n)th element. 
Applying the Hadamard inequality [Bellman (1970), page 130], we can show 
there exists some constant M such that 

det(An) = det(In + fin) < ((1 + M/fif + {n- l){M/nff'^, 

which is bounded. 

To show A-i = In + 6 n, we first note that by Oppenheim's inequal- 
ity [Mirsky (1955), page 421], which yields the inequality for the deter- 
minant of Hadamard product of positive definite matrices, det(Vn o Tn) > 
det(Vn) ni<i<n tii where tu is the diagonal element of T„. Therefore, det(An) > 
1. We only need to show that the («,j)th cofactor 

Aij = det(An)l{i=j} + Ou{l/n) + l|j=i or j=n}Ou{X)- 

Similar to proving det(An) = 0^(1)) we can show all the (n — 1) by (n — 1) 
cofactors are also On(l)- In addition, Aij = 0„(l/n) for 1 < j < n,i j 
since it has one row of elements On(l/ra) and replacing that row with 0^(1) 
would yield a bounded determinant. To complete proof of the lemma, it 
remains to show An = det(An) -|-Ou(l/ra), 1 <i <n, which is true by Laplace 
expansion det(An) = (1 + Ou{l/n))Aii + J2j:j^iO-ij^ij and observing that 
T.j-.j^iO'ijAij = Ou{l/n), for 1 < i < n. □ 

Lemma 3. For any 6£ [a,b], let Sn{0), n = 1,2,..., be a sequence of 
random variables such that E{SniO)) = Ou{{iognY) , E[Sn{0) — ESn{0)]^ = 
Ouii^ognY) uniformly in for some constant r > 0. Assume that, with prob- 
ability one, Sn{d) is difjerentiable with respect to 9 and 8^(0) = Ou(n^(logn)^) 
uniformly in 9. Then, 

sup \Sn{d)\ = o^n^^"^) a.s. 

9G[a,b] 

For the ease of notation, we will suppress the dependence of any quantity 
on n and parameters [e.g., 6ij,n(^) = bij, di^nid^a"^) = di], wherever confusion 
does not arise, throughout the rest of the paper. 
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Proof of Lemma 3. Let a = 60 < Oi < ■ ■ ■ < Om,, = b partition [a,b] 
into intervals of equal length, where M„ is the integer part of n^/^+" for 
some < a < 1/14. Then, 

(A.21) sup|S„(e)|< max + max sup - 

9G[a,b] l<k<M„ l<fc<A/n6»G[6»fc_i,0fe] 

Because there exists constant C > such that the sixth central moment of 
SniO) is uniformly bounded by C(log7i)'", 



P( max \Sn{ek)-E{Snm)\>7i^ 

\l<k<Mn 

< P{\Sn{9k) - E{Sn{ek))\ > 



< M, 



k=l 

C{lognY C(logn) 



Since 3/2 — 7a > 1 with a < 1/14, it follows from Borel-Cantelli lemma that 
max \Sn{ek)-E{Sn{ek))\=0{n^/^-'') a.s. 

l<k<M„ 

Since E{Sn{d)) = 0{{lognY) uniformly in 6, and 

.J^^^^ l'S'n(6'fe)| < inax \Sni0k)- E{Sni0k))\+ niax \E{Sni9k))\, 



then 



(A.22) max \Sn{0k)\=O{n''^~'^) a.s. 

l<fc<Af„ 

On the other hand, for 9 G [9^-1, dk\-, 
\Sn{9k) - < sup \S'M{9k - = 0„(n^/2-"(logn)^) a.s. 

Therefore, 



(A. 23) max sup \Sn{9k)-Sn{9)\=o{n^'^) a.s. 

The proof is completed by combining (A.21), (A.22) and (A. 23). □ 



Proof of Theorem 1. Recah that the tapered and untapered log- 
likelihoods are given by (1.3) and (1.1), respectively. The proof of (2.1) 
consists of direct comparisons of the log determinants and the two quadratic 
forms. First, Lemma 2(iii) implies that 



(A.24) log[det(V„ o T„)] = log[det(V„)] + 0„(1). 
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Define 11^(9) = (V-i(V„ o T„))-i - I„. Then, H„ = 6^ by Lemma 2(m). 
Because 

(A.25) (V„oT„)-i=V-i + H„V-\ 

(A.26) X'„(V„ o T„)-iX„ = X;,V;^iX„ + X^H^V-^X,. 

Proof of (2.1) would be completed if, uniformly in (^,0"^), with probabihty 
1, 

(A.27) X'„H„V-iX„ = o„(ni/2). 

We will apply Lemma 3 to prove (A.27). Define 

Sn{9) = (T^X^H„V„^X„, 

and note that Sn{0) depends on 9 but not on a^. In view of symmetry of 
H„V~^ by (A.25), we can write 

(A.28) EoSn{9) = a2trace{H„V-iV„,o}, 

where hereafter in the proof the expectation is evaluated under the true pa- 
rameter (Tq and 6*0, and V„^o = ^^(6*0, o"^). The rth cumulant of X^H.„V~^X^ 

(A.29) ^c, = 2^-i(r- l)!trace{H„V-iV„,or, 

r = 1, 2, . . . [see Searle (1971), Theorem 1, page 55]. 
Next, we show that 

(A.30) V-\9,a^)Vni9o, ag) = Ou{l)In + 6„. 

Then, it follows from (A.28)-(A.30) and (A. 7) that the first moment and 
the sixth central moment of Sn{9) are uniformly bounded, because the sixth 
central moment of Sn{9) is kq + I5K4K2 + IOK3 + 15^1, which is uniformly 
bounded because all of the four cumulants involved are uniformly bounded. 

We now give explicit expression for the elements of \^~^{9,a^) based on 
Lemma 1 and the following well-known result [e.g., Ripley (1981), page 89]: 

(A.31) Y-H0,^^)=^n\0,a^)Bn{9), 

where B„(6') = (6jj,n(6'))i<jj<n and D„(6',cj2) = diag{(ii(6', cj^), f = 1,. . .,n}, 
in which bij^n{9), di{9,a'^) are defined as in Lemma 1 and bii^n{9) = 1. 

For brevity, we drop the parameters in the matrices and write B„^o = 
B„(0o) and D„^o = D„(0O) ctq)- Decompose into 

= D-iB„,o + D-i(B„,o - B„) = Ai + A2. 

Then, 
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and the diagonals of D~^D„^o converge uniformly to {alOQ) / {a'^O) by (A.3). 
Therefore, 

(A.32) AiV„,,o = diag(04l), . . . , 0„(1)). 

In addition, 

(A. 33) A2V„ = On uniformly in S [a, h]. 

Indeed, the absolute value of the (i, j)th element of A2V„ qi ^ <i <n \s 

n 
k=l 

\k-i\<l 

1 



n 



where the last equality follows from (A.3), (A. 4) and the Taylor theorem. 
Similarly, we can show the elements on the first and last rows are Otj(l). 
Hence, (A. 30) follows from (A.32), (A. 33) immediately. Last, note that 
■^Sn{0) = Ouin^) by Lemma 2. The conditions of Lemma 3 are satisfied. 
Therefore, 

(A.34) sup Sn[e) = o{n^l'^), 

ee[a,6] 

which implies (A. 27). 

We have now proved (2.1). (2.2) can be proved similarly, and the remain- 
ing proof will be brief. The derivatives of the log likelihood functions can be 
written as 

(A.35) §QUO,a^) = -tracejv-i^} +X;V-i^V-iX„, 

^^n,tap(0, ^2) = - trace|(V„ o T„)-^ (-^ o T„ j | 



(A.36) 



+ X;(V„ o T„)"i o T„, ) (V„ o T^r'X^ 



We first show that the two traces differ by 0^(1) . Write A„ = V„ ^ (V„ o T„) . 
It is straightforward to verify 

(V. o T„)-i ( ^ o T„,) = A-i V„-i ^ A. + A-i 



V 80 " J " 39 " " 39 ■ 

Then, 

tracej (V„ o T„)-i o | = tracej V^T^^j + trace (a;;'^ 
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where the second trace in the right-hand side is clearly uniformly bounded 
by Lemma 2. Similarly, we can write 

(V„ o Tn)-' o T„) (V„ o T„)-i 

dd n * rt 

for some matrix W„, which is O^. Using the exact same technique for 
deriving (A. 27), we can show 

X;W„V-iX„ = o„(n^/2)_ 
The proof is complete. □ 

Proof of Theorem 2. First, for (2.3), it suffices to show that, for any 
e>0, 

(A.37) Po( inf {l^^^ ^e,a^)-lnMpiO,c7^)}-^^)=l, 

where (^,o"^) G J can be any fixed vector such that Ba"^ = QqOq. 

Ying (1991) has shown (A.37) for the log likelihood function ln{9,a'^). 
More specifically, Ying (1991) showed that, uniformly in {9,(t'^) G J and 
|6'(T^ — 9a'^\ > e, with probability 1, 

ln{0,a^) - ln{0, a^) >r]n + 0„(n^/2+a) any a > 

[see the proof of Theorem 1 in Ying (1991), page 289]. Then, (A.37) follows 
because of (2.1) in Theorem 1. 

Similarly, we can show (2.4) by using (2.2) and some asymptotic results 
in Ying (1991). We can write [see (3.10) and (3.11) in Ying (1991), page 
291] 

k=2 

where 

X(ifc)-e-^o^'=X(tfe_i) 



Wi 



k.n 



Note that Wk^n depends only on the true parameters and are i.i.d. A^(0, 1) 
for A; = 1, . . . , re. 

Then, for any {9, cr^) G J, we have 



= ^0^0 - 1) - "(^'^ - ^o^o) + Ou{n 

k=2 



20 J. DU, H. ZHANG AND V. S. MANDREKAR 

by Theorem 1. In particular, for {6,a^) = {6n,ta.p,^n,ta.p^' left-hand side 
is zero. Therefore, we obtain 

n 
k=2 

Since k = 1, . . . ,n, are i.i.d. Xij we have 

n 
k=2 

The proof is complete. □ 

APPENDIX B: PROOFS FOR SECTION 3 

We will employ some known properties of equivalent Gaussian measures 
and will refer to Ibragimov and Rozanov (1978) frequently. Two measures 
Pj,j = 0, 1 are equivalent if they are absolutely continuous with respect to 
each other. Let ^(t), t £ be Gaussian stationary under the two equivalent 
measures Pj, where D \s a, bounded subset in for some d>\. Let X„ = 
(X(ti), . . . ,X(t„))' denote the observations in D, and let Pj{xi, . . . ,Xn) de- 
note the density function of X„ under measure Pj for j = 0,1. Then, the 
Radon-Nikodym derivative pn = pi(X.n) / PoO^n) has a limit p with Pq- 
probability 1. In addition, 

Po(0<p<oo) = 1, lim £;o(log/3n) = £^o(logp) and 

n^oo 

(B.l) 

— oo < E'ollogp) < oo. 

We refer the readers to Section III. 2.1 of Ibragimov and Rozanov (1978) for 
these results. It follows that the log-likelihood ratio and its expectation are 
all bounded. With Po-probability 1, 

log /On = ^n,l(Xn) - /„,o(X„) 

(B.3) £„(logp„) = -ilog^l^ - ^Eo{X„{Vli - V„-i)X„) = 0(1), 

The difference of these two equations yields 

(B.4) Ki^l'n - Vo";n)Xn " i^o (X'„ ( V^;,;, - V^^JX^) = 0(1) a.s. 
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Before we proceed with the proof of the main results in Section 3, we wiU 
estabhsh the following lemmas. For two functions a{x),b{x), we write a{x) x 
b{x),x — > oo if — oo < limmix^oo o-ix) /b{x) < limsup^_^a{x)/b{x) < oo. 

Lemma 4. Let /i(A) be the spectral density corresponding to isotropic 
Matern covariogram K{h]ai,9i) and /i(A) be the spectral density corre- 
sponding to the tapered covariance function K{h;ai,6i) = ct^, ^i)i^tap(^)- 
Under condition (A3), there exists r > 1 such that 

(B.5) ^^^^^ = 0{\^r) as \X\^oo. 

Proof. Using the fact that Fourier transform of product of two func- 
tions is the convolution of their Fourier transforms, we have 

(B.6) /i(A) = / /i(x)/tap(A - x)dx, 

where /tap is the spectral density corresponding to i^tap- It is seen that 
/i(A)//i(A) does not depend on af so that we can assume without loss of 
generality that af = 1. It suffices to consider the case that A > 0, because 
/i(A) is symmetric about A = 0. Using /ig/tap(A — x)dx = 1 and breaking 
down these integrals over intervals (— cx), A — A'"'] U [A -)- A'"', +00) and (A — 
A*^, A + A'^) for any k G (0, 1), we have 

A (A) ^ _ /|A-x|>Afc /i(a;)/tap(A - x) dx r 
/i(A) /i(A) y|A-x|>A'= 

/|A-x|<Afc(/l(^) - /l(^))/tap(A - x) dx 

= ri + r2 + T3. 

By condition (A3), we have 

"^^"-(i + a4+V2+./,Ja)/^^(")^"- 

The Matern spectral density has a closed form 
In addition, / /i(A) dA = cr^ = 1 is the variance. Then, 



/tap (A - x)dx 



(B.8) |Ti| < 



cef^(l + A2fc)'^+V2+£- 
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Similarly, 

M 

(B.9) iTal < 



+ e)A2'=(^+^)' 

Since e > 1/2 and v + e>\ by condition (A3), we can choose k to be suffi- 
ciently close to 1, so that both Ti and T2 are 0(A~^) for some r > 1. 
To bound T3, write, for some ^ between A and x, 

h{x) - MX) = f[{X){x - A) + /{'(O^^^. 



Then, 



T^Yrf/^lA) / (x-A)/tap(A-x)dx 

+ / /('(6^^^/tap(A-x)dx). 



'|A-x|<Afc 

The first term is because the integrand is odd. For the second term, note 



^ c0f(2z. + 3)2 



Therefore, if A is sufficiently large, for ^ lying between x and A, where x is 
in the interval |x — A| < A'^, 

cgf(2z. + 3)2 2cgf(2z. + 3)2 

U <^ Jl _ ^ _ ^fc)2).+3/2 - ^2.+3 

Then, 

Condition (A3) implies that 2;2/tap(x) is integrable. Then, T3 = 0{\"^). The 
proof is complete. □ 

Lemma 5. For any real number r > 0, there exists ^r(A) such that 
(B.IO) Cr{X)= J Cr{t)exp{-iXt)dt, 0<|Cr(A)|2>^|A^^ A^oo, 
where Cr{t) is square integrable and has a compact support. 
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Proof. We only need to show the case < r < 1, because the product 
of any functions of the type given by (B.IO) belongs to this type, due to the 
fact that the Fourier transform of convolution coincides with the product of 
Fourier transforms. Let 



We will show that ^r(A) satisfies (B.IO). We only need to prove it for A > 0, 
because ^r(A) is symmetric about A = and ^r(O) > 0. Let u = Xt. We can 
write 

er(A) = 2A-''/2 cos{u)vrl^-^du. 





Then, Cr(A)^ >i l^^l"'' as A — > +oo, because cos{u)u^'/'^~'^ is integrable for < 
r < 1. 

Next, we will show ^^(A) > for any A > 0. It suffices to show 

(B.ll) y(A)= / cos{u)u-^ du>Q, 

Jo 

where (5 = 1 -r/2 G [1/2, 1). Note that y'{X) = cos{X)X^^ and y"{X) = - sin(A) x 
A~^ — (5cos(A)A^'^^^. Therefore, the minimum points are {Ikn + 37r/2,fc = 
0, 1, . . .}. So, we only need to show y{2k-K + 37r/2) > 0, A: = 0, 1, . . . by induc- 
tion. First, using monotonicity of cos(u), we have 

/37r\ /■7r/4 e-K/2 

y[ — )= / cos{u)u'~^ du + / cos{u)u~^ du 

V 2 / Jo Jn/4 

f3TT/2 



(B.12) 



+ / cos(ti)n du + cos{u)u du 

J-it/2 Jn 

(7r/4)i-^ 1 / V2\ 1 1 



1-6 {-K/2Y V 2 y {ti/2Y t 

\f2 ^ 2 ( \f2\ [2 1 
2 vr V 2 J y IT Wtt 



Next, suppose y{2{k — 1)tt + 37r/2) > 0, for k>l, then 
y(2fe7r + 37r/2) = y{2{k - l)7r + 37r/2) 

+ / cos{u)u'~^ du+ j cos (u) u~ ^ du 

J2fc7r-7r/2 J2fc7r-|-7r/2 



r2kTT+TT/2 

2/(2(/c - l)7r + 37r/2) + / cos (u) u' ^ du 

J2fc7r-7r/2 
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cos(n)(u + 7r)~^ du 

2fc7r-7r/2 

= y(2(fc- l)7r + 3V2) 

/•2fc7r+7r/2 

+ / cos(n)(ii"'^ - (u + 7r)~'^)dit. 

J2/c7r-7r/2 

The integral is positive because the integrand is positive. This completes the 
proof of Lemma 5. □ 

Proof of Theorem 3. Write the Cholesky decomposition of Vo,n = 
LL' for some lower triangular matrix L. Let Q be an orthogonal matrix such 
that 

QL-iVi,„L'-iQ' = diag{4„, . . . , J. 

Then, 

Q'L'Vi-;,LQ = diag{l/a2 „, . . . , l/a^ „}. 
Taking the trace of both sides, we have 

n 

trace (VcnVj;;^) = ^ l/aln- 

i=l 

Hence, 

" / 1 \ 

EoiKi^l'n - Vo-;i)X„) = trace(Vo,„Vi;i) - n = ^ ( — - 1 ) . 
Let e„ = QL~^X„. Obviously, 



(B.13) Eoene',^ = In, £;ie„e'„ = diag{cri cr„^„}. 

Equation (3.7) follows if, for any orthogonal sequence {77^, k = 1,2, . . .} in 
the Hilbert space L|,(dPo) spanned by X{t),t £ D under the covariance inner 
product corresponding to Pq, there exists a constant M > independent of 
r]k,k = 1,2, ... , such that 



(B.14) 

k=l 



1 

1 



< M. 



One important technique to prove (B.14) is to write, for any s,t £ D, 
(B.15) EiX{t)X{s)-EoX{t)X{s)= [ [ e'^^'~''^'^'l>{X,fi)dXdfi, 

where ^{X,fi) is square integrable with respect to Lebesgue measure on R^. 
For any bounded region D, the existence of such a function $ and, therefore, 
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the equivalence of Pq and Pi, are shown in Ibragimov and Rozanov (1978), 
page 104, Theorem 17, under the assumption that the function /i(A) in (3.6) 
is square integrable. However, we will show, under the assumption of this 
lemma that h{\) is integrable, ^ takes a particular form 

(B.16) $(A,^)=¥^$2(/i) / e-^^^-^^-^dw 

JT 

for some functions $j(A), A G R such that / |<I>j(A)p//o(A) dA <oo, j = 1,2, 
and a compact interval T that is solely determined by ri and r2. This par- 
ticular form is central to the proof, and we will establish it at the end of 
this proof. We now proceed by assuming it is true. 

Let dZo(A) denote the stochastic orthogonal measure so that X{t) has the 
spectral representation under measure Pq; that is, X{t) = J exp{—iXt) (iZo(A). 
Then, for any r/ G L'jj{dPQ), there is a function (j){X) such that rj = J 0(A) dZo^X) 
and Eor]^ = J |0(A)|2/o(A) dA. We first show 

(B.17) Eirj^ = I |0(A)|Vi(A)dA, 

(B.18) Eiyf-Eorf = J Jj(X)^{^)^{X, n) dXdfi. 

Indeed, the two equations hold for r/ = X{t) = J exp(— zAt) dZo(A) for any 
t€D [assuming (B.15) is true]. Consequently, they hold for any linear com- 
bination 

J 

r, = Y,cjX{t,)= J 0(A)dZo(A) 

for any J and ti, . . . ,tj E D, where 0(A) = J2j=i Cje~^^*^ . 

For any 7] G L'j-,[dPQ), we can find a sequence of finite linear combinations 
of X{t),t G D, say, Tjm, m = 1, 2, . . . , such that limm->oo Eq^t] — r]m)^ = 0. If 
= / 0m (A) d^o(A), we have 

(B.19) So(?? - r]m? = j 10(A) - 0™,(A)p/o(A) dX ^ 0. 

Then, 

j 10(A) -0^(A)|2/i(A)(iA = j |,^(A)-0^(A)|2/o(A)(l + MA))dA^O, 

because /i = (/i — fo)/ fo is bounded. It follows that rjm converges in L^(dPi) 
norm to some variable fj because Ei{r]i — ??m)^ = / |0«(A) — 0m(A)P/i(A) dX — > 
as Z, m ^ oo. Then, 

Eifi^= hm Eiril= lim / |0„(A)|Vi(A) dA = / |0(A)| Vi(A) dA. 
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Since L2 convergence implies convergence in probability, we have r/^ — > fj 
in probability Pi. On the other hand, r/^ — > in probability Pq and, conse- 
quently, in probability Pi, due to the equivalence of the two probabilities. 
Then, we must have Pi(r/ = fy) = 1 and Eirf = Eijf . We have proved (B.17). 
To show (B.18), note that 

j j 4>m{\)(pm{lJ-)^{X,tJ')d\dfI- J j (j){X)(f){ll)^{X,fl)d\dn 



(B.20) < \{<PmiX)-H>^))<PM\mX,f^)\dXdfi 



+ \{ci)M-<Pip))HmH>^,f^)\dXdfi, 



where the first term tends to zero, because Cauchy-Schwarz inequality im- 
plies its square is bounded by 



\^m{x)-(t>{x)rMx)dx J \(t>M\'fo{fi)dfi 



/o(A) J /o(/i) 



by (B.19) and square integrability of <l>i(A)/-\//o(A), where and hereafter |r| 
stands for the length of finite interval T. Similarly, we can show the second 
term in (B.20) also tends to zero. Therefore, (B.18) is now proved by taking 
the limit of Eii]'^ — EqT]'^ and // 4>m{X)(j)m{lJ')^{X, ij,) dXdfi. 

Applying (B.18) to the orthonormal sequence r]k = J 0fc(A) dZo{X), k = l, 
2, . . . , we have 

Ei7ll-'^= I [M^Mi^) I e-'^^-^'^^ dw^JX)^2{fi)dXdfi 



T 



M,k{^)A2,k{^)duj, 

T 

where Aj^k{uj) = J ^^(A) exp(iAci;)$j (A) dA. Since (3.6) and continuity of fj 
imply that /i(A) > C/o(A) for some constant C > 0, we have Eirj'^ > CEqij^ = 
C and, therefore, 

\l/E^r^l-l\<\E^r,l-l\/C<{l/2C)Y^ j \A^,u{u^)\'' du. 

In view that Aj^k{uj) is the inner product of the two integrable functions 
0fc(A)/o(A)^/2 a^'d exp(iAa;)$j(A)//o(A)i/2 in L\dX), and that ,^fc(A)/o(A) ^2, 
= 1, 2, . . . , is an orthonormal sequence in L^{dX) [because -Eq^/^ = / ^^^A) x 
0fc(A)/o(A) dA], we have, by Bessel's inequality, 

00 „ 

El^i,fcHl'< / |^i(A)|V/o(A)(iA<cx). 
k=i ■' 
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It follows that 

oo 



fe=i j=i"'^fc=i 

<(|r|/2C)^ j |<^,(A)|V/o(A)dA<oo. 

We just need to show (B.15) and (B.16) to complete the proof. We will 
employ the following well-known properties of Fourier transform. For any 
square integrable functions (with respect to Lebesgue measure) ipj{X), A G 
W^, there are square integrable functions aj (t),t G M'^ such that 



(pj{\) = i exp(— «A't)aj(t) dt, 



Furthermore, 

(B.21) ipi{X)ip2{X)= I exp(-a't)(ai*a2)(t)dt, 

(B.22) / exp(iA't)v9i(A)(^2(A)(iA = (27r)'^(ai *a2)(t), 

where all the equalities are in the Lp'{dX) sense, and ai *a2 is the convolution; 
that is, 

ai*a2(t)= / ai(s)a2(t — s) (is. 

By Lemma 5, there exists a continuous and square integrable function 
(j = 1,2) such that 

(B.23) ^j{X) = J Cj{t)exp{-iXt)dt, < |^j(A)|2 x |A^'^^ A ^ oo, 

for some square integrable function Cj{t) that has a compact support [i.e., 
Cj(t) is outside a compact set]. 

Let C(A) = (/o(A) -/i(A))/|6(A)p. Then, ^(A) is square integrable by the 
assumption of the theorem and the properties of ^i(A). Therefore, we can 
write, for some square integrable function c(t), ^(A) = / exp(— zAt)c(i) dt. 
Furthermore, for all s,t, 



EoX{s)X{t) - EiX{s)X{t) = / e^^(^-*)(/o(A) - /i(A))dA 

(B.24) 

= y'e^^(^-*)e(A)|6(A)pdA, 
which we will denote by b{s,t). By (B.21), 

|^i(A)|2 = J exp{-iXt)(^J ci{z)ci{z - t) dz^ dt. 
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Applying (B.22) to ^(A) and |6(A)P, we get 



6(s,t) = 27r / c{w) / ci{z)ci{—{s — t — w — z))dzdw 

(B.25) 

= 2tt c{u — v)ci{s — u)ci{t — v) dudv, 

which holds for all s,t €M. If we restrict s,t to the compact set D, the 
integral (B.25) is an integral over a compact set, say, A x A. This is because 
ci is outside a compact interval. 

Next, we write c{t) as a convolution of two functions. For this purpose, 
we write ^(A) =^2(A)^3(A). Then, ^3(A) so defined is square integrable from 
assumptions and (B.23) and, therefore, can be written as 



e3(A)= J exp{-iXt)c3{t)dt. 
Then, = 02*03 and, consequently, 

(B.26) c{u — v) = / C2{x)c3{u — V — x) dx = / 02{u — uj)o3{u> — v) du). 



Since we are only interested in b(s,t) for s,t £ D and, consequently, only 
interested in o{u — v) for u,v € A, we will restrict both u,v to the interval 
A, so that the second interval in (B.26) is an integral on a finite interval, 
say, T, because 02 has a compact support. Define the bivariate function 



a{u,v) = / 02{u — uj)o3{uj — v) duj, u,vGB 
Jt 

which is square integrable because 

|a(n,f)p<|T| / |c2(n — a;)p|c3(a; — f )p 



and both 02 and 03 are square integrable. In addition, for u,v £ A, we have, 
from (B.26), 

a{u, v) = c{u — v). 
We therefore have shown that, for s,t£D, 

b{s,t) = 2-K / a{u,v)oi{s — u)oi{t — v) dudv. 

Note that the integral is a convolution of functions of (n, v). Applying (B.22), 
we get 

27r6(s,t)= / exp{i{\s + iJ.t))Lpi{X, fi)(p2{X, fJ') dXdfi, 
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where 



¥?i(A,/i) 

V32(A,/i) 



Clearly, 
Now, 



(/?i(A,/i) 



C2(2:)c3(-y)e-*«^'+'^)^+(j^+'^)^) dxdydcu 

{-y)e-'y^' dx dy^ e-^(^+^)'^ dto 



Hence, 
b{s,t) 



1 

2^ 



Um2WU-^^)U-^^) I e-'^^+^^'^dujdXdi, 

JT 

— / exp(i(As — /it))$i(A)<&2(M) / exp(— z(A — dtj dA 
27r Jm? Jt 



for c^i(A) = 6(A)e2(A) and cl>2(^) = Clearly, / |<I>, (A)| V/o(A) dX < 

oo by the assumption of the theorem, (B.23) and the square-integrability of 
^2 and ^3. The proof is complete. □ 

Proof of Theorem 4. Let aj be such that ajdj" = alOl^ , and let Pj 
be the probability measure under which the process has a Matern covari- 
ogram with parameters {9j,a'j) for j = 0, 1. Then, Pq = Pi by Theorem 2 
in Zhang (2004). Consequently, we only need to show that (3.8) and (3.9) 
hold, almost surely, with respect to Pi. 

Let /i(A) = /i(A; 6*1, erf ) be the spectral density under measure Pi and 
/2(A) the corresponding tapered spectral density as defined in Lemma 4, 
from which we see that, for some constant c> 0, 

/2(A)-/i(A)|2 



|Al>c 



/i(A) 



d\ < 00, 



30 



J. DU, H. ZHANG AND V. S. MANDREKAR 



which is a sufficient condition for the equivalence of the measures Pi and P2 
where P2 is the measure corresponding the tapered spectral density f2- 

Let Vj j = 1,2, be the covariance matrix corresponding to the spectral 
densities fj that depend on al and 9i and do not depend on o"^. For any o"^, 
we have 

(B.27) ^ 

= - log(det V2,„/det Vi,„) - ^X:,(V2;i - Vv'jXn. 



Split it into three additive terms as follows: 
detV2,n 



(B.28) 



log 



det Vi_„ 
2 



+ (i-^)i?i(xUV2-^-vi;;,)x, 



a 



a' 



(B.29) 

= /i+/2-/3- 

Because Pi = P2, the ffi'st term is bounded as we discussed previously in 
(B.3). Similarly, by (B.2), the third term I3 is bounded uniformly in cr^ G 
[WjU], almost surely. The second term I2 is also bounded uniformly in o"^ G 
[w,v] because, by Theorem 3, 

(B.30) Ei{Ki'^2'n - Vi-^)X„,) = 0(1). 

Therefore (3.8) is proved. To show (3.9), first observe 

(B.31) ^/.,tap(^l,^^)-^/ , , ^ 

which can be rewritten as 
,2 



i)<^^) — ~~t(XJ^V2 JjX„-Xj^V]^ ?jX„), 



(B.32) 



-gi?i(xuv2-i-vi-^)x„). 



Then, (3.9) immediately follows (B.4) and Theorem 3. □ 

Proof of Theorem 5. As erf = cr§(6'o/^i)^'^, we only need to show 

iV(0,2), 



(B.33) 
(B.34) 



~ 2 



•iV(0,2). 
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Let Vj^n be the covariance matrix of X„ corresponding to parameter values 
{ei,a^),j = 0,1. Write Vl n — fT^Ri^n, where R-i,n is the correlation matrix. 
First, we note that o"^ has a closed form express 

(B.35) al = ^K^^'n^n 

that can be derived straightforwardly from the maximization. Then, 



(B.36) 

= (1/ V^)(xUvi;i - Vo;! )x„,) + V^f ^"^"'"^" - 1 



n 



Since Vq^^X^ consists of i.i.d. A^(0, 1) variables, X^Vq^X„ is the sum of 
i.i.d. variables having a Xi distribution. The central limit theorem implies 
that the second term in (B.36) converges in distribution to A^(0,2). 

Equation (3.10) in Theorem 5 follows if the first term is shown to be 
bounded almost surely with respect to Pq. In view of (B.4), if suffices to 
show that 

Eo(xUvi;;,-Vo-^)x„) = o(i). 

To this end, we only need to verify that conditions of Theorem 3 are satisfied. 
The Matern spectral density (B.7) satisfies, as A — > oo, 

(B.37) 0</(A;^.,a2)~|A|-(2-+i). 

Moreover, in view of a^OQ" = alOi", 

(B.38) 

where /i(A) stands for f{X;6i,af), i = 0, 1. Using the Taylor expansion, we 
can get 

h{X) ~ |Ar^ 

Hence, (3.10) is proved. 

Next, we derive the asymptotic distribution of the tapered MLE a^^tg^p. 
Similar to a^, the tapered MLE 

tap takes the closed form 

where Ri,n is the tapered correlation matrix corresponding to Ri,n- It fol- 
lows (3.10) in Theorem 4 that 

^ri^l,n^n - ^'ri^ll^n = 0{l). 
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Then, 

<^n,tap = ^X^R^^X, + 0(l/n) = al + 0(l/n). 

It follows immediately that d^n,ta.p ^'^d have the same asymptotic distri- 
bution. The proof is complete. □ 

Acknowledgments. The authors thank the editors and two referees for 
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